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Abstract 

Recently,  Sherman  et  al.  [14]  analyzed  an  M/G/l  retrial  queueing  model  in  which  customers 
are  forced  to  retry  their  service  if  interrupted  by  a  server  failure.  Using  classical  techniques, 
they  provided  a  stability  analysis,  queue  length  distributions,  key  performance  parameters,  and 
stochastic  decomposition  results.  We  analyze  the  system  under  a  static  Bernoulli  routing  policy 
that  routes  a  proportion  of  arriving  customers  directly  to  the  orbit  when  the  server  is  busy 
or  failed.  In  addition  to  providing  the  key  performance  parameters,  we  show  that  this  system 
exhibits  a  dual  stability  structure,  and  we  characterize  the  optimal  Bernoulli  routing  policy  that 
minimizes  the  total  expected  holding  costs  per  unit  time. 
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1  Introduction 


As  a  model  for  streaming  multimedia  applications,  Sherman,  et  al.  [14]  recently  analyzed  an 
M/G/l  retrial  queueing  system  with  an  unreliable  server,  infinite-capacity  retrial  queue  (or  orbit), 
and  an  infinite-capacity  primary  queue.  In  that  model,  a  customer  in  service  is  forced  to  join 
the  orbit  if  the  server  fails  during  his/her  service  cycle.  Customers  sent  to  the  orbit  persistently 
and  independently  retry  the  server  at  random  intervals  until  their  service  is  complete,  but  they 
can  only  regain  access  to  the  server  if  it  is  up  and  idle  at  the  time  of  a  retrial  attempt.  Primary 
customer  arrivals  who  find  the  server  busy  or  failed  always  join  the  FIFO  primary  queue,  whereas 
those  who  find  the  server  up  and  idle  are  served  to  completion  if  their  service  is  not  interrupted 
by  a  server  failure.  Using  the  method  of  supplementary  variables,  they  established  the  existence 
of  dual  stability  conditions,  one  for  the  orbit  and  one  for  the  primary  queue,  and  derived  the 
probability  generating  functions  of  the  primary  queue  length  and  orbit  size,  as  well  as  the  mean 
performance  parameters.  They  also  demonstrated  that  the  orbit  size  and  system  size  exhibit  a 
stochastic  decomposition  property.  Their  model  assumes  that  arriving  customers  join  the  primary 
queue  by  default  if  the  server  is  found  busy  or  failed.  However,  in  many  engineering  applications, 
it  is  possible  to  mitigate  congestion  in  the  primary  queue  by  statically  or  dynamically  routing 
some  arriving  customers  directly  to  the  orbit  to  retry  their  service  later.  The  primary  aim  of  this 
extended  note  is  to  analyze  the  unreliable  retrial  model  of  [14]  when  a  controller  uses  a  Bernoulli 
routing  policy  to  manage  congestion  in  the  primary  queue. 

Our  model  here  is  motivated  by  a  particular  type  of  internet  protocol  for  streaming  media 
applications  known  as  IP  multicast ,  or  simply  multicast.  Multicast  provides  a  way  to  deliver  a 
single  media  stream  to  a  group  of  users  linked  via  a  local  area  network  (LAN).  For  example,  a 
training  seminar  being  conducted  at  a  particular  location  can  be  streamed  to  groups  of  individuals 
on  a  LAN  at  a  separate,  remote  location.  A  digitized  camera  feed  uploads  the  live  video  to  the 
internet,  and  recipients  participate  in  the  seminar  (though  not  interactively)  through  the  streamed 
video  at  individual  computer  network  terminals.  To  ensure  wider  dissemination  of  the  seminar,  it  is 
often  desirable  to  save  a  complete  copy  of  the  streamed  content  on  a  hard  disk  at  the  remote  location 
for  future  playback.  Because  the  seminar  is  live,  there  is  a  need  to  ensure  timely  transmission  of 
real-time  packets,  even  at  the  expense  of  some  packet  losses.  Packets  may  be  dropped  by  the 
network  administrator  to  relieve  congestion  in  the  primary  packet  transmission  queue,  or  they 
can  be  dropped  if  their  transmission  fails  due  to  packet  corruption,  hardware  failures,  software 
errors,  or  congestion  in  the  local  network  or  the  internet  itself.  However,  dropped  packets  can  be 
retransmitted  later  (when  the  transmission  medium  becomes  free)  so  that  the  complete  seminar 
can  be  “patched-up”  in  the  stored  copy  for  future  playback.  Consequently,  dropped  packets  are  not 
lost  but  are  necessary  to  ensure  a  high-quality  stored  copy.  The  primary  packet  transmission  queue 
mimics  a  1-persistent  carrier-sense  multiple-access  (CSMA)  system.  When  the  oldest  packet  in  the 
primary  queue  detects  that  the  transmission  medium  is  free,  transmission  begins  immediately.  If 
the  communication  medium  fails  during  transmission,  the  packet  is  sent  to  a  retrial  queue  which  is 
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analogous  to  a  non-persistent  CSMA  system.  In  a  non-persistent  system,  packets  do  not  persist  to 
wait  for  a  free  transmission  medium  but  rather  retry  the  transmission  medium  at  random  intervals 
until  it  is  found  up  and  idle.  Therefore,  those  packets  that  are  dropped,  either  upon  arrival  by  the 
network  administrator  or  due  to  a  server  failure,  are  assigned  a  lower  priority  than  the  real-time 
packets  that  enter  the  primary  queue  and  are  time-sensitive.  The  latter  type  corresponds  to  the 
primary  (or  priority)  customers.  Because  dropped  or  interrupted  packets  can  be  retransmitted 
for  inclusion  in  the  stored  copy,  their  transmission  time  is  no  longer  important.  These  packets 
correspond  to  the  retrial  customers  who  enter  the  orbit  and  retry  the  server  until  it  is  found  up 
and  idle.  The  administrator  seeks  to  determine  the  optimal  (static)  proportion  of  arriving  packets 
to  admit  to  the  primary  queue  with  the  objective  of  minimizing  the  total  expected  holding  costs 
per  unit  time  in  the  primary  queue  and  the  orbit. 

While  it  is  well  known  that  dynamic  routing  policies  are  well-suited  for  non-st ationary  regimes 
and  generally  outperform  static  policies,  dynamic  routing  requires  a  great  deal  of  information 
gathering,  storage,  updating,  and  exchange.  By  contrast,  static  routing  can  be  much  faster,  easier 
to  implement,  and  require  far  less  overhead.  Applications  in  computer /communications  networks 
and  service  systems  abound.  For  example,  Cornbee  and  Boxma  [7]  considered  the  allocation  of 
tasks  to  a  finite  number  of  distributed  processors.  In  ATM  networks,  static  routing  algorithms  are 
often  employed  to  balance  the  load  across  computer  network  links  (cf.  Casseti  et  al.  [4]).  In  service 
systems,  Servi  and  Hurnair  [12]  optimized  Bernoulli  routing  probabilities  at  discrete  time  points  to 
balance  the  load  in  large-scale  call  centers  by  estimating  arrival  rates.  Besides  their  relatively  low 
overhead  requirements,  static  policies  can  also  be  used  to  aid  engineers  in  designing  communications 
or  computer  networks  by  providing  performance  bounds  that  can  be  easily  computed  and  evaluated. 

More  generally,  Bernoulli  routing  of  arriving  customers  to  a  finite  group  of  ordinary  (non¬ 
retrial),  homogeneous  queueing  systems  has  been  studied  extensively  in  the  queueing  literature  (cf. 
[5,  10]  and  references  therein).  Under  certain  conditions,  it  can  be  shown  that  the  optimal  Bernoulli 
routing  policy  is  to  assign  an  arriving  customer  to  any  one  of  a  finite  number  of  available  servers 
with  equal  probability.  Relatively  few  researchers  have  considered  routing  policies  for  arriving 
customers  in  retrial  systems.  Choi  and  Park  [6]  analyzed  a  Bernoulli  routing  policy  for  a  system 
that  is  similar  to  the  model  in  [14]  except  that  it  does  not  consider  server  failures.  Atencia  and 
Moreno  [3]  examined  Bernoulli  routing  in  a  model  with  general  (as  opposed  to  exponential)  retrial 
times.  Their  model  permits  only  the  retrial  customer  at  the  head  of  the  line  to  retry  the  server 
which  is  assumed  to  be  reliable.  Liang  and  Kulkarni  [11]  studied  optimal  dynamic  routing  in  a 
retrial  system  in  which  both  retrial  customers  and  primary  arrivals  are  routed  either  to  the  primary 
queue  or  to  the  orbit.  They  proved  the  existence  of  a  threshold- type  policy  that  routes  all  customers 
to  the  primary  queue  up  to  a  threshold  after  which  all  arrivals  are  routed  to  the  orbit.  Their  model, 
however,  does  not  consider  unreliability  of  the  server  which  significantly  complicates  the  analysis. 

In  this  extended  note,  we  revisit  the  queueing  system  analyzed  in  [14]  and  include  static 
Bernoulli  routing  of  arriving  customers  who  find  the  server  busy  or  failed.  Specifically,  a  con- 


2 


trailer  routes  an  arriving  customer  to  the  primary  queue  with  (fixed)  probability  q  and  to  the 
orbit  with  complementary  probability  1  —  q,  independently  of  everything  else,  during  busy  or  failed 
periods.  Our  main  objective  is  to  provide  the  primary  performance  parameters  of  this  system 
and  to  determine  the  optimal  Bernoulli  routing  policy  that  minimizes  the  total  expected  holding 
costs  per  unit  time.  Following  an  analysis  similar  to  the  one  in  [14],  we  (i)  provide  the  necessary 
and  sufficient  overall  stability  condition,  (ii)  derive  the  generating  functions  of  the  primary  queue 
length  and  the  orbit  size  distributions,  (iii)  provide  the  mean  congestion  performance  parameters, 
and  (iv)  determine  the  optimal  Bernoulli  routing  policy.  It  is  shown  that  the  stability  condition 
of  the  overall  system  and  the  steady  state  distribution  of  the  server’s  status  are  insensitive  to  the 
Bernoulli  routing  parameter;  however,  the  routing  parameter  plays  a  crucial  role  in  establishing 
the  stability  region  of  the  primary  queue.  Moreover,  we  provide  sufficient  conditions  to  ensure 
the  existence  of  a  unique,  optimal  Bernoulli  routing  policy  that  minimizes  the  expected  total  cost 
per  unit  time  of  holding  customers  in  the  primary  queue  and  the  orbit.  The  structure  of  the  cost 
function  is  characterized  explicitly  by  the  holding  cost  coefficients,  the  service  time  distribution 
and  the  arrival,  failure,  retrial,  and  repair  rates. 

The  remainder  of  the  paper  is  organized  as  follows.  Section  2  provides  a  complete  model 
description  and  defines  the  key  generating  functions.  In  section  3,  we  state  the  necessary  and 
sufficient  condition  for  overall  system  stability  and  provide  the  generating  functions  of  queue  lengths 
when  the  server  is  idle,  busy,  or  failed.  We  also  provide  expressions  for  the  mean  queue  lengths 
needed  for  the  cost  function.  Section  4  shows  how  our  model  generalizes  a  few  other  M/G/l  retrial 
models  in  the  literature.  Section  5  provides  structural  results  for  the  cost  function  and  characterizes 
the  optimal  Bernoulli  routing  policy,  while  section  6  provides  a  few  illustrative  examples. 

2  Model  Description 

Primary  customers  arrive  to  the  queueing  system  in  accordance  with  a  homogeneous  Poisson 
process  with  rate  A.  A  primary  customer  who  finds  the  server  idle  (and  not  under  repair)  seizes  the 
server  and  is  completed  if  no  server  failures  occur  during  its  service  cycle.  Should  the  server  fail 
during  service,  the  interrupted  customer  joins  a  retrial  queue  (or  orbit)  from  which  it  persistently 
retries  the  server  at  random  intervals  until  access  is  regained.  Customers  who  are  interrupted  must 
repeat  their  service  cycle  (i.e.,  a  preemptive  repeat  discipline  is  employed).  In  contrast  to  the  model 
of  [14] ,  the  system  uses  a  controller  who  diverts  a  proportion  of  arriving  primary  customers  directly 
to  the  retrial  orbit  when  the  server  is  busy  or  failed  to  manage  the  congestion  level  of  the  primary 
queue.  If  a  primary  arrival  finds  the  server  busy  or  under  repair,  the  controller  routes  the  customer 
to  the  primary  queue  with  probability  q  (0  <  q  <  1),  and  to  the  retrial  orbit  with  complementary 
probability  p  =  1  —  q,  independently  of  everything  else  (i.e.,  a  Bernoulli  routing  policy  is  used).  The 
uninterrupted  service  times,  {Sn  :  n  >  1},  are  independent  and  identically  distributed  (i.i.d.)  with 
absolutely  continuous  cumulative  distribution  function  (c.d.f.)  B  and  probability  density  function 
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(p.d.f.)  b.  For  s  >  0,  let 


f 

b*(s)  = 

Jo 


cb{x)  dx 


denote  the  Laplace  transform  of  b.  We  consider  here  both  active  and  idle  failures  of  the  server. 
That  is,  server  failures  occur  according  to  a  Poisson  process  with  rate  £  whenever  the  server  is  idle 
or  busy;  however,  the  server  cannot  fail  when  it  is  under  repair.  The  repair  time  is  assumed  to  be 
exponentially  distributed  with  mean  1/a.  We  pause  here  to  note  that,  while  it  may  be  possible  to 
analyze  the  model  with  general  repair  times  (by  adding  a  supplementary  variable  or  by  embedding  a 
Markov  chain  at  appropriate  epochs),  we  assume  exponential  repair  times  to  maintain  transparency 
of  the  analysis  and  consistency  with  the  model  analyzed  in  [14].  Customers  who  enter  the  retrial 
orbit  (either  by  virtue  of  a  server  failure  or  by  being  routed  to  the  orbit  by  the  controller)  retry  the 
server  directly  at  exponentially-distributed  time  intervals  with  mean  1/6.  The  inter-retrial  times 
are  independent  and  identically  distributed  for  each  customer  in  the  retrial  group.  Moreover,  retrial 
customers  behave  independently  of  one  another,  of  customers  in  the  primary  queue,  and  of  external 
arrivals  to  the  system.  Finally,  customers  in  the  retrial  group  can  gain  access  to  the  server  only  if 
it  is  up  and  idle  at  the  time  of  a  retrial  attempt.  The  arrival,  service,  failure,  repair,  and  retrial 
processes  are  assumed  to  be  mutually  independent. 

As  in  Sherman  et  al.  [14],  we  analyze  this  model  using  classical  techniques,  namely  the  method 
of  supplementary  variables  and  probability  generating  functions.  Adopting  their  notation,  let  Qt 
denote  the  number  of  customers  in  the  primary  queue  at  time  t,  excluding  any  customer  that  might 
be  in  service,  and  let  Rt  denote  the  number  of  customers  in  the  retrial  group  at  time  t.  The  random 
variable  Ut  is  the  occupation  status  of  the  server  given  by 


Ut  = 


if  the  server  is  occupied  at  time  t, 
if  the  server  is  not  occupied  at  time  t, 


while  St  describes  the  operational  status  of  the  server  at  time  t  defined  by 


{1,  if  the  server  is  not  failed  at  time  t, 

0,  if  the  server  is  failed  at  time  t. 

Let  Xt  denote  the  elapsed  service  time  of  the  customer  in  service  at  time  t  so  that  the  continuous¬ 
time  stochastic  process,  {{Qt,  Ut ,  Rt,  St,Xt )  :  t  >  0}  is  a  Markov  process  describing  the  state  of  the 
system.  Further  define  Nt  as  the  total  number  of  customers  in  the  system  at  time  t  (i.e.,  in  orbit, 
in  the  primary  queue,  and  in  service).  Our  primary  aim  is  to  study  the  steady  state  versions  of  Qt, 
Rt  and  Nt  which  we  denote  by  Q,  R,  and  N,  respectively.  Using  these  quantities,  we  will  establish 
conditions  under  which  a  unique  optimal  Bernoulli  routing  parameter  exists.  However,  before  doing 
so,  it  is  of  interest  to  examine  the  influence  of  Bernoulli  routing  on  the  stability  condition,  queue 
length  distributions,  and  performance  parameters  of  the  queueing  system.  To  this  end,  define  for 
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j  >  0,  k  >  0,  and  x  >  0, 

7r0,o,j,i  =  I™  P(Qt  =  0,  Ut  =  0,Rt  =  j ,  St  =  1), 

t— >oo 

7Tfc,o,j,o  =  lim  P(Qt  =  k,  Ut  =  0 ,  Rt  =  j,  St  =  0), 

t— >oo 

=  lim  P(Qt  =  fc,C/t  =  1,-Rt  =  j,  St  =  l,Xt  <  x), 

the  limiting  probabilities  that  the  server  is  idle,  failed,  or  busy,  respectively,  when  there  are  j 
customers  in  the  retrial  group  and  k  customers  in  the  primary  queue.  Next,  define  the  generating 
functions 

OO 

^0,0, lOl)  =  ^2zl ^0,0, j,h  M  <  1, 

3=0 

OO 

4*k, 0,o(^l)  ^  |^1 1  —  1) 

3=0 

oo 

^>k,i,i(x,zi)  =  yy^ijx),  |zii  <  i,  x  >  o. 

3=0 

Here,  0o,o,i("i)  is  the  generating  function  of  R  when  the  server  is  idle,  0k,o,o(zi)  is  the  generating 
function  of  R  when  the  server  is  failed  and  k  customers  are  awaiting  service  in  the  primary  queue, 
and  4>k,i,i(x,  zi)  is  the  generating  function  of  R.  when  the  server  is  busy,  k  customers  are  in  the 
primary  queue,  and  the  elapsed  service  time  of  the  customer  in  service  has  not  exceeded  x.  Further 
define,  respectively, 

OO 

T/VoOi ,z2)  =  yz$(f>ki0to(zi),  |zi|  <  1,  \z2\  <  1, 
k= 0 

00 

V’l ,l(x,Zl,Z2)  =  yZ2(j>k,l,l(x,zl),  \zi\  <  1,  \z2\  <  1, 

k= 0 

the  generating  functions  of  4>k, o,o(zi)  and  4>k,i.i{x,  zi)  with  respect  to  the  primary  queue  size.  The 
joint  generating  function  of  the  orbit  and  primary  queue  size  when  the  server  is  busy  is  given  by 

roo 

^i,i(zi,  z2)  =  /  'ipiii(x,z1,z2)dx. 

Jo 

The  joint  generating  function  of  (R,  Q)  will  be  denoted  by 

OO  OO 

G(zi,z2)  =  E  (zi,z^  =  yy¥(R  =  j.  Q  =  k)  z\  z%,  \zi\  <  1,  \z2\  <  1, 

k= 0 j= 0 

and  the  generating  function  of  the  steady  state  system  size  N  is  given  by 

00 

H(z)  =  E(zN)=yF{N  =  j)zj,  \z\<l. 

3= 0 

In  section  3,  we  provide  the  generating  functions  ^o,o,i(£i)>  '0o,o('zi-  z2),  and  '^\.\{z\ ,  z2)  when 
a  Bernoulli  routing  policy  is  used.  The  relevant  performance  parameters  are  derived  from  these 
results. 
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3  Main  Results 


In  this  section,  we  state  the  necessary  and  sufficient  condition  for  stability  of  the  overall  queueing 
system  and  also  provide  the  generating  functions  of  ( R ,  Q )  and  N.  Central  to  the  analysis  is  the 
notion  of  the  fundamental  server  period  introduced  by  Aissani  and  Artalejo  [1] .  The  fundamental 
server  period  is  the  time  from  which  the  server  initiates  a  new  service  cycle  until  the  next  time  it 
commences  a  new  service  cycle.  Let  Nr  and  Nq  respectively  denote  the  number  of  customers  entering 
the  orbit  and  primary  queue  during  a  fundamental  server  period,  and  let  a(i,j )  =  F(Nr  =  i,Nq  =  j). 
i,j  >  0.  As  before,  we  let 

OO  OO 

Q(zi,z2)  =  ^2^2a(i,j)z[z32,  \zi\  <  1,  \z2\  <  1. 

i= 0  j= 0 


Following  along  the  lines  of  the  proof  of  Theorem  2.1  of  [1] ,  it  can  be  verified  that  the  joint  generating 
function  of  (Nr,  Nq )  is  given  by 


Q(z  i,z2)  =  B(zi,z2 )  + 


_ azi£(l  -  B(zi,z2)) _ 

(a  +  A(1  -  pzi  -  qz2))(£  +  A(1  -  pz\  -  qz2 )) 


where  B(zi,  z2)  =  b*(£,  +  A(1  —  pz\  —  qz2))  and  b*(x)  is  the  Laplace  transform  of  b  evaluated  at  x. 
The  first  term  on  the  right-hand  side  of  Q(z±,z2)  is  the  generating  function  of  the  total  number 
of  arrivals  to  the  system  when  there  is  no  server  failure  during  the  service  cycle  that  begins  the 
fundamental  server  period,  while  the  second  term  is  the  generating  function  of  the  same  quantity 
when  the  service  is  interrupted  by  a  failure  and  the  server  undergoes  a  repair.  When  q  =  1  (p  =  0), 
Q(zi,z2)  reduces  to  the  generating  function  of  (Nr,Nq)  of  [14].  Using  standard  methods,  the 
expected  number  of  arrivals  to  the  primary  queue  during  a  fundamental  server  period  is  given  by 


Pi  =  Q'Z2(  1,1) 


Ag(l-y(0)(q  +  0 

a £ 


where  Q'Z2(z\,z2)  =  dQ(z\,z2)/dz2.  In  what  follows,  it  will  be  shown  that  p\  plays  an  important 
role  in  determining  the  stability  condition  of  the  primary  queue.  Moreover,  it  can  be  shown  (see 
Lemma  1  in  [14])  that  the  equation 

^2  -  Q(zi,z2)  =  0 


has  a  unique  solution,  call  it  g(z±),  inside  the  region  | ^2 1  <  1  whenever  \z\\  <  1  or  |zjJ  <  1  and 
pi  >  1.  Whenever  z\  =  1,  g(  1)  is  the  smallest  positive  real  zero  with  g{  1)  <  1  if  p±  >  1,  and 

g(  1)  =  1  if  pi  <  1. 

Next,  we  state  the  necessary  and  sufficient  condition  for  stability  of  the  overall  system  (and 
the  orbit)  and  simultaneously  state  the  generating  functions  </>o, 0,1(21),  ^0,0(21,  22),  and  ^1,1(21,  z2). 
These  results  generalize  those  reported  in  [9,  14]. 


Theorem  1  The  queueing  system  is  stable  if  and  only  if 


=  A(l-6*(0)(a  +  Q 

ab*(£)f 


<  1. 


(1) 
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When  p  <  1,  the  generating  functions  0o,o,i (^1) ?  ^0,0(21,  z2),  and  ^1,1(21,22)  are  given  by 

1  fl  A(!  ~  9(u))  +  g  (l  ~  a+\(l-Zu-q9(U))) 


</>0,0,l(-l)  =  (1  -  p) 


a+?yexpi  @ 


Z 1 


s(«)  -  « 


and 


where 


^q,q{zx,z2)  =  £  ^0,0,l(^l). 

^2)  =  a  A^>  0o,o,i (-^i) 


du  )  ,  (2) 

(3) 

(4) 


Ao, o(zi,  z2)  =  (g(z1)-z1)[a+\(l-pzi-qg(z1))][£+\(l-pz1-qz2)][z2-B(z1,z2)-zi(l-B(z1,z2))\ 
+  Azi(l  -  6(2i,22))(22  -  2i)(l  -pzi  -  qg(zi))[a  +  £  +  A(1  -  pzi  -  qg(zi))\, 


Ai,i(zi,  z2)  =  (1  -  B(z\,z2))(z2  -  g{z\)) 

x  [(1  -  2i){a£  +  [a  +  A(1  -pz\  -  qz2)][a  +  £  +  A(1  -  pzi  -  qg(zi))]} 

+A£(1  -  pzi  -  qg(zi))(l  -  pzi  -  qz2)\  , 


AOi,z2)  =  (5(21)  -2i)[a  +  A(l  -  pzi  -qg(zi))\ 

x  (z2  -  B(zi,  z2))  [a  +  A(1  -  pzi  -  qz2)]  [£  +  A(1  -  pzi  -  qz2)\  -  a£(  1  -  B(zi,  z2))z1 


and  for  z\  £  [0, 1],  g(z\)  verifies 


9(zi) 


b*{£,  +  A(1  -  pzx  -  qg(zx)))  + 


a£zi[l  -  b*(C  +  A(1  -  pzi  -  gg(zi)))\ 

[a  +  A(1  -  pzi  -  qgizx))}^  +  A(1  -  pzx  -  qg(zt))\ ' 


Proof.  We  omit  the  proof  for  the  sake  of  brevity.  However,  the  result  can  be  shown  by 
following  the  steps  of  the  proof  of  Theorem  1  in  [14].  Note  that  the  generating  function  B(z2)  in 
[14]  is  replaced  by  -6(21,22)  above.  ■ 


If  a  Bernoulli  routing  scheme  is  not  used  (i.e.,  if  q  =  1),  all  arriving  customers  who  find  the  server 
busy  or  under  repair  will  join  the  primary  queue  by  default.  In  such  case,  the  generating  functions 
(2)— (4)  are  the  same  as  those  reported  in  Theorem  1  of  [14]. 

The  system  controller  is  faced  with  the  task  of  deciding  the  appropriate  proportion  of  arriving 
customers  to  route  to  the  retrial  orbit  when  the  server  is  busy  or  failed.  If  this  proportion  is 
too  small,  the  primary  queue  may  become  unstable.  On  the  other  hand,  if  the  proportion  is  too 
large,  a  significant  number  of  customers  will  be  denied  immediate  access  to  the  service  system  and 
asked  to  return  later  for  service.  Our  objective  is  to  determine  the  Bernoulli  routing  policy  that 
balances  this  tradeoff  and  minimizes  the  total  expected  holding  costs  per  unit  time.  To  this  end, 
we  next  characterize  the  primary  performance  parameters  (namely  the  mean  queue  lengths)  using 
the  generating  functions  of  ( R ,  Q)  and  N. 
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Proposition  1  For  p  <  1,  the  generating  function  of  (R,  Q )  is  given  by 


(5) 


and  the  generating  function  of  N  is  given  by 


(6) 


where 


ik^z)  =  [a  +  ^  +  A  —  Az]6*(£  +  A  —  A z)  {a£  +  A(1  —  z)[a  +  £  +  A  —  As]}  , 
^(z)  =  [a  +  A  —  A z\  {a£6*(£  +  A  —  A z)  —  A (z  —  &*(£  +  A  —  A*)) [a  +  £  +  A  —  Az]} 


and  Aoto(zi,  Z2),  A-i,i(zi,  zf),  and  A (21, 22)  are  defined  in  Theorem  1. 

Proof.  The  generating  function  of  (R,  Q )  is  obtained  by  summing  over  the  three  mutually 
exclusive  and  exhaustive  server  states,  i.e. , 

G(zi,Z2)  =  0o,o,i(zi)  +  0o,o(zi,  Z2)  +  0i,i (zi,  z 2) 

where  0o,o,i(zi),  0o,o(~i,  Z2),  and  0i,i(zi,  22)  are  given  by  (2),  (3),  and  (4),  respectively.  Similarly, 
the  generating  function  H(z)  is  obtained  directly  by 

H(z)  =  0o,o,i (z)  +  0o,o (z,  z)  +  z 0i, 1(2,  z). 

■ 

Remark:  When  q  =  1,  these  results  are  the  same  as  those  reported  in  Corollary  1  of  [14].  Alter¬ 
natively,  if  we  assume  that  the  server  is  reliable  (i.e.,  if  we  allow  £  j  0),  then  we  obtain  the  joint 
generating  function  of  Choi  and  Park  [6].  Next,  we  obtain  the  mean  values  of  R,  Q,  and  N  in  the 
following  proposition. 

Proposition  2  The  steady  state  mean  orbit  size,  mean  primary  queue  size,  and  mean  number  in 
system  are  respectively  given  by 


E  (R) 


1  '  A  fp  +  ap(£  +  A  p) 

1  —  p  ad 


A 


(1  -  Pl/p)  {£36*(£)  -  (a  +  £)2[£5'  -  A(1  -  &*(£))]}  +  a£26*(£)(  1  -  &*(£))(!  -  Pi) 


a£6*(£)(a  +  £)(!  -  pi) 


(7) 


£26*(g)  -  (a  +  £)[(a  +  £)R'-apb*(£)] 
a£6*(£)(a  +  £) 


(8) 
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and 


E  (IV)  = 


1  ~  P 


A  £p  +  ap(£  +  A  p) 


ad 


+- 


A  &*(£)  {£3  +  (1  -  &*(£))[a£(a  +  20  +  A(a  +  £)2]}  -  £(«  +  t)2B' 


£&*(£) 

where  B'  =  A  J0°°  xe~^xb{x)dx. 


«£&*(£)(<*  +  £) 


(9) 


Equations  (7)  and  (9)  show  that  p  <  1  is  necessary  to  ensure  the  stability  of  the  retrial  orbit  and  the 
system,  and  by  (8)  we  see  that  pi  <  1  is  necessary  for  the  stability  of  the  primary  queue.  Clearly, 
for  any  £  >  0,  p\  <  p  so  that  the  stability  of  R  depends  on  p  and  not  p\ .  Moreover,  when  £  >  0  and 
q  <  1,  some  interesting  insights  about  the  stability  of  the  primary  queue  and  the  orbit  are  revealed. 
Specifically,  there  exists  a  set  of  model  parameters  for  which  the  primary  queue  will  remain  stable 
even  if  the  orbit  is  not  stable.  The  dynamics  of  the  system  dictate  that  retrial  customers  become 
subordinate  to  primary  customers  since  they  can  regain  access  to  the  server  only  when  it  is  found 
to  be  up  and  idle.  That  is,  retrial  customers  experience  a  smaller  effective  service  rate  than  do 
primary  customers;  therefore,  the  orbit  may  continue  to  grow  while  the  primary  queue  remains 
stable.  From  a  service  quality  perspective,  it  is  desirable  to  keep  both  average  queue  lengths  small; 
however,  under  certain  conditions,  the  controller  may  choose  to  route  arrivals  only  to  the  primary 
queue,  or  only  to  the  orbit,  during  busy  or  down  periods.  Next,  using  the  results  of  Theorem  1, 
the  steady  state  distribution  of  the  server’s  status  is  obtained. 


Proposition  3  For  p  <  1,  the  steady  state  distribution  of  the  server’s  status  is  given  by 

P(Idle)  =  00,0,1(1)  =  (1  -  p)  > 

P(Failed)  =  0o,o(l,  1)  =  —^77, 

a  +  £ 

and 

P(Busy)  =  Vh,i(l,  1  )  =  p  ' 

Remark:  The  steady  state  distribution  of  the  server’s  status  is  intuitive,  i.e.,  the  probability  that 
the  server  is  busy  is  simply  the  traffic  intensity  p  scaled  by  the  long-run  proportion  of  time  that 
the  server  is  not  under  repair,  a(a  +  £)-1.  Similarly,  the  steady  state  probability  that  the  server  is 
idle  is  (1  —  p)  scaled  by  the  proportion  of  time  the  server  is  not  under  repair.  Because  our  model 
allows  for  both  active  and  idle  server  failures,  the  long-run  proportion  of  time  the  server  is  failed  is 
intuitively  given  by  £(a  +  £)_1.  It  is  noteworthy  that  the  steady  state  distribution  of  the  server’s 
status  is  insensitive  to  both  the  retrial  rate  9  and  the  Bernoulli  routing  parameter  q. 

Several  other  M/G/  1-type  retrial  models  can  be  analyzed  as  special  cases  of  ours,  and  we 
present  a  few  of  these  in  the  next  section.  We  characterize  the  optimal  Bernoulli  routing  parameter 
in  section  5. 
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4  Some  Special  Cases 


The  last  section  revealed  the  insensitivity  of  the  stability  condition  and  the  distribution  of  the 
server’s  status  to  the  parameters  6  and  q.  Interestingly,  the  stability  condition  and  server  status 
distribution  for  the  present  model  are  identical  to  those  derived  in  [14].  However,  equations  (7)-(9) 
show  that  the  retrial  rate  and  Bernoulli  routing  parameter  have  a  significant  impact  on  the  mean 
queue  lengths.  In  this  section,  we  demonstrate  that  our  model  can  be  used  to  analyze  other  M/G/l 
retrial  models.  To  this  end,  let  S  denote  an  arbitrary  (uninterrupted)  service  time  with  c.d.f.  B, 
and  for  k  >  1,  let 

Pk  =  E(Sk)  <  oo. 


Denote  by  p  the  traffic  intensity  of  the  ordinary  (non-retrial)  M/G/l  queue  with  a  perfectly  reliable 
server  so  that  p  =  A/?i,  and  suppose  that  p  <  1.  Now,  as  £  [  0  in  (7) — (9) ,  we  obtain 


E(R)  = 


A  2p 


and 


1  ~P 
E  (Q)  = 

E(N)  =  p  + 


Pi  P 2 

6  '  2(1  —  qp)\  ’ 

A  2qP2 
2(1  —  qp)  ’ 


A2 


1  -  p 


PlP  P2 

e  +  2 


(10) 


(ii) 


(12) 


Equations  (10)  and  (11)  are  identical  to  equation  (16)  of  Choi  and  Park  [6]  who  studied  a  similar 
retrial  model  with  a  primary  queue,  retrial  orbit,  and  a  reliable  server.  Now,  if  we  set  q  =  0  (p  =  1) 
in  (10)— (12),  our  model  reduces  to  the  standard  M/G/l  retrial  queue  (i.e. ,  one  which  does  not 
possess  a  primary  queue)  with  a  reliable  server,  and  the  results  are  equivalent  to  those  reported  by 
Artalejo  and  Gomez-Corral  [2]  and  Falin  and  Templeton  [8].  Specifically, 


E(R)  = 


A2 


1-p 


Pi  P2 

6  2 


and 


E(N)  =p  + 


A2 


1  ~P  . 


Pi  P2_ 

6  +  2 


Note  that  E(Q)  =  0  here  because  all  arriving  customers  who  find  the  server  busy  are  routed  to  the 
retrial  orbit  (i.e.,  there  is  no  primary  queue  to  accommodate  waiting  customers). 

Next,  consider  our  original  model  and  allow  q  — >  1.  With  q  =  1,  we  obtain  the  model  analyzed 
by  Sherman  et  al.  [14]  in  which  all  arriving  customers  who  find  the  server  busy  or  failed  join  the 
primary  queue.  Allowing  q  — >  1  in  (7)— (9)  gives  precisely  the  results  obtained  for  E(R),  E(Q),  and 
E(IV)  in  [14] .  Moreover,  if  the  uninterrupted  service  time  distribution  is  exponential  with  parameter 
p,  we  obtain  the  mean  queue  lengths  of  the  model  described  in  [13] . 

In  the  next  section,  we  consider  the  problem  of  minimizing  the  total  expected  cost  per  unit 
time  of  holding  customers  in  the  primary  queue  and  the  orbit  under  a  Bernoulli  routing  policy. 
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5  Optimal  Bernoulli  Routing  Policy 

We  now  consider  the  problem  of  determining  an  optimal  Bernoulli  routing  policy  for  the  retrial 
system  with  an  unreliable  server.  The  objective  is  to  minimize  the  total  expected  holding  costs  per 
unit  time.  Under  this  criterion,  we  provide  sufficient  conditions  to  ensure  the  existence  of  a  unique 
routing  parameter,  q* .  Denote  the  holding  cost  per  customer  per  unit  time  in  the  primary  queue 
by  cq  (0  <  cq  <  oo),  and  let  cr  (0  <  cr  <  oo)  be  the  holding  cost  per  customer  per  unit  time  in 
the  retrial  orbit.  We  consider  only  stable  systems,  i.e. ,  those  for  which  p  <  1.  Using  (7)  and  (8), 
our  optimization  problem  is  of  the  form 

min  D(q)  =  crE(7?)  +  cqK(Q)  (13a) 

s.t.  <7  G  [0, 1],  (13b) 

where  it  is  understood  that  E (R)  and  E (Q)  depend  explicitly  on  q.  The  optimal  solution  (when 
it  exists)  will  be  denoted  by  q*.  We  elucidate  the  structure  of  il(q)  and  characterize  the  optimal 
Bernoulli  routing  parameter  in  Proposition  4. 


Proposition  4  Suppose  that  p  <  1  and  Z  >  0.  Then, 

(i)  The  cost  function  d(q)  is  monotone  increasing  on  [0,1]  if 


cq  t 
—  -  1  > 


Z2b*(Z)(a  +  Z)(Z  +  ap) 


Ci?  0(1  -  p )  { eb*(0  -  («  +  £)2  [z&  -  A(1  -  b*(0)\  }  ' 

In  this  case,  the  optimal  routing  parameter  is  q*  =  0; 

(ii)  The  cost  function  d(q)  is  monotone  decreasing  on  [0,1]  if 
CQ 


_  1  <  eb*(Q(a  +  m  +  ap)(  1  -  b*(Z)p)2 

°r  e(i-p){eb*(d)-(a  +  zy 


ZB’-\(l-b*(0) 


y 


In  this  case,  the  optimal  routing  parameter  is  q*  =  1. 
(in)  The  cost  function  il(q)  is  strictly  convex  on  [0,1]  if 


Z2b*(Z)(a  +  £)(£  +  ap)(  1  -  b*(Z)pY 


<^-l 


0(1  -  p)  {e36*(0  -  (a  +  02  [zb1  -  A(1  -  b*(Z))\  }  °R 

Z2b*(Z)(a  +  f)(Z  +  ap) 


< 


(14) 


(15) 


0(1  -  p)  [eb*(Z)  -  (a  +  Z)2  [ZB1  -  A(1  -  b*(Z))\  } 

In  this  case,  q*  uniquely  solves  ’d'(q)  =  0  and  is  given  by 


•  (16) 


q  = 


,  _  I f cQ  i\  e(i-p){g3fe*(0-(«+g)2[gB'-A(i-&*(0)]} 

V  Vc«  / 


b*(Z)p 


(17) 
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Proof.  Substituting  IE (R)  and  E(Q)  from  (7)-(8),  respectively,  and  differentiating  $(g)  respect 
to  q.  we  obtain 


,_d0(9)  A  |{V({)  -  (a  +  {)2[{B'-A{1  -  &•(€))]}  \((  +  ap) 

(«)-  T:  (=0  c«)  c.C=6*{0(a  +  0(1  - 


d, 


<18) 


The  cost  function  $(q)  is  (strictly)  monotone  increasing  on  [0, 1]  if  ^'(q)  >  0  for  each  q  £  [0, 1],  and 
in  such  case  it  attains  its  minimum  at  the  boundary  point  0.  Equation  (18)  shows  that  '()' (q)  >  0  if 


(cq  -  cR) 
or  equivalently,  if 


CQ  i 
—  -  1  > 

cr 


\  {&*($- (a +  z)2[ZB,-\(i-b' 

a£2b*(f)(a  +  £)(1  -  qb*(£)p)2 

eb*(0(u  +  0(Z  +  ap) 


>  Cr 


A(£  +  ap) 
9a(  1  —  p) 


9(1  -  p)  {e3&*(0  -  (a  +  02  [&'  -  A(1  -  &*(£))]  } 


Similarly,  $(g)  is  (strictly)  monotone  decreasing  on  [0, 1]  if  $'(q)  <  0  for  each  q  £  [0, 1],  and  in  this 
case  $(g)  attains  its  minimum  at  1.  By  (18),  this  condition  is  met  if 

cq  _  1  < _ C2b*(0(«  +  £)(£  +  «P)(1  -  b*(t)P? 

CR 


9(1  -  p)  {e3&*(0  -  (a  +  02  [&  -  A(1  -  &*(£))]  } 


Finally,  we  prove  that  there  is  a  region  for  which  the  strict  convexity  of  i9(q)  is  ensured.  Differen¬ 
tiating  (18)  with  respect  to  q  yields 


r(q)  = 


d 2P(q) 

d  q2 


=  (cq  -  Cr) 


2Ap{£36*(£)  -  (a  +  02^1  -  A(1  -  &*(£))]} 
af2(a  +  0(1  -  qb*(£)p)3 


(19) 


The  denominator  of  (19)  is  strictly  positive  for  any  q  £  [0,1]  and  p  <  1  since  0  <  6*(£)  <  1  for 
any  £  >  0.  We  can  conclude  that  the  numerator  is  also  strictly  positive  since  it  was  shown  in  the 
proof  of  Lemma  2  of  [14]  that  A(1  —  &*(£))  —  £2?/  >  0  for  £  >  0.  By  rearranging  the  terms  of  (16), 
it  is  seen  that  cq  must  exceed  cr  in  this  region.  Therefore,  we  conclude  that  when  condition  (16) 
is  met,  ’d"(q)  >  0  for  each  q  £  [0, 1],  and  hence,  D(q)  is  strictly  convex  and  has  a  unique  stationary 
point  q*  satisfying  rd'(q*)  =  0.  By  (18),  this  point  is  given  by 


q 


* 


l  - 


£2&*(0(“+f)(£+“p) 

b*(Qp 


(20) 


The  advantage  of  Proposition  4  is  that  it  allows  us  to  determine  the  form  of  the  optimal  solution  by 
simply  checking  the  value  cq /cr  —  1 .  Section  6  illustrates  and  highlights  the  main  results  through 
a  few  numerical  examples. 
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6  Numerical  Examples 


In  this  section,  we  illustrate  the  behavior  of  the  cost  function  i?(g)  in  the  three  regimes  iden¬ 
tified  by  Proposition  4.  Additionally,  we  provide  a  comparison  of  the  derived  mean  performance 
parameters  (namely  the  mean  queue  lengths)  with  values  obtained  via  a  discrete-event  simula¬ 
tion  model.  For  both  illustrations  we  consider  two  absolutely  continuous  service  time  distributions 
whose  Laplace  transforms  (LTs)  are  well  defined,  namely  the  exponential  and  uniform  distributions. 
To  compute  the  mean  queue  lengths  and  the  cost  function,  we  need  expressions  for  &*(£),  the  LT 
of  the  service  time  distribution  evaluated  at  £,  and  B'  which  is  given  by 

x  exp(— £x)b(x)dx. 


Let  S  denote  an  arbitrary  (uninterrupted)  service  time.  When  S  is  distributed  exponentially  with 
rate  //,  it  is  easy  to  show  that 


and 


b*(0 


A4  +  £ 


Xfi 

(v+ar 

When  S  is  distributed  uniformly  on  the  interval  (0,  y),  0  <  y  <  oo,  we  obtain 


b*(0 


1 


[1  -  exp (-y£)]  > 


and 

B'  =  K1  “  expC-^X1  +  V0\  ■ 

Before  comparing  mean  queue  lengths  with  simulated  results,  we  first  illustrate  the  cost  func¬ 
tion,  ^(q).  For  both  distributions  we  use  the  following  parameters:  A  =  2.0,  /a  =  10.0,  £  =  1.0, 
a  =  2.0,  and  8  =  2.0.  In  the  case  of  exponential  service  times,  the  traffic  intensity  is  p  ~  0.3100. 
Figure  1  depicts  two  cases  when  the  cost  function  i?(q)  is  monotone.  In  Figure  1(a),  the  cost  pa¬ 
rameters  are  cq  =  8.0  and  cr  =  2.0  so  that  cq/cr  —  1  =  3.0,  and  the  cost  function  is  monotone 
increasing  on  [0,1]  as  dictated  by  part  (i)  of  Proposition  4.  For  Figure  1(b),  we  use  cq  =  2.5 
and  cr  =  1  so  that  cq/cr  —  1  =  1.50,  and  the  cost  function  is  monotone  decreasing  on  [0,1]  in 
accordance  with  part  (ii)  of  Proposition  4. 
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(a)  Exponential  case  (i):  i9(q)  is  monotone  increasing.  (b)  Exponential  case  (ii):  $(q)  is  monotone  decreasing. 

Figure  1:  Sample  monotone  cost  functions  when  service  time  is  exponential. 

In  case  (i),  the  optimal  Bernoulli  routing  parameter  and  corresponding  minimum  cost  are  given 
by  q*  =  0.0  and  $(q*)  ~  4.12381,  respectively.  Using  this  set  of  parameter  values  and  holding 
cost  coefficients,  it  is  optimal  to  divert  all  primary  arrivals  to  the  orbit  when  the  server  is  busy  or 
failed.  For  case  (ii),  the  optimal  Bernoulli  routing  parameter  and  corresponding  minimum  cost  are 
q*  =  1.0  and  '$((/*)  ~  1.71905,  respectively.  In  this  case,  it  is  optimal  to  admit  all  new  arrivals  to 
the  primary  queue  when  the  server  is  busy  or  failed,  despite  the  fact  that  the  holding  cost  in  the 
primary  queue  is  more  than  double  that  of  the  retrial  queue.  This  case  illustrates  the  fact  that  the 
primary  queue  can  remain  stable  even  as  the  retrial  queue  continues  to  grow  and  shows  that  the 
relative  magnitudes  of  the  cost  coefficients  are  not  the  only  determinants  of  the  optimal  Bernoulli 
routing  policy. 

Figure  2  depicts  the  cost  function  when  cq  =  3,  cr  =  1,  and  cq/cr  —  1  =  2.  The  cost  function 
is  strictly  convex  on  [0, 1]  in  accordance  with  part  (iii)  of  Proposition  4.  The  optimal  Bernoulli 
routing  parameter,  computed  by  equation  (17),  and  the  corresponding  minimum  cost  are  given  by 

q*  «  0.6450  and  0(q*)  «  1.9319, 

respectively.  In  this  case,  we  see  that  there  is  a  tradeoff  between  the  cost  coefficients  and  the  growth 
of  the  queue  lengths.  In  particular,  it  is  optimal  for  the  controller  to  divert  roughly  35%  of  the 
arrival  stream  directly  to  the  orbit  while  nearly  65%  are  admitted  to  the  primary  queue. 


14 


Figure  2:  Exponential  case  (iii):  'd(q)  is  strictly  convex. 

Next,  we  consider  the  case  when  the  service  time  S  is  uniformly  distributed  on  (0 ,y)  where  we 
assume  y  =  4///,  i.e.,  the  mean  service  time  is  twice  that  of  the  exponential  case  with  all  other 
parameter  values  unchanged.  For  this  case,  the  traffic  intensity  is  p  ~  0.63989.  Similar  graphs  can 
be  drawn  for  this  case;  however,  the  cost  coefficients  need  to  be  altered  to  conform  to  the  conditions 
of  Proposition  4. 

Figure  3  depicts  two  cases  when  the  cost  function  'd(q)  is  monotone.  In  Figure  3(a),  the  cost 
parameters  are  cq  =  30  and  cr  =  2  so  that  cq/cr  —  1  =  14,  and  the  cost  function  is  monotone 
increasing  on  [0, 1]  as  dictated  by  part  (i)  of  Proposition  4.  For  Figure  3(b),  we  use  cq  =  2  and 
cr  =  1  so  that  cq/cr  —  1  =  1,  and  the  cost  function  is  monotone  decreasing  on  [0, 1]  in  accordance 
with  part  (ii)  of  Proposition  4.  In  case  (i),  we  obtain  q*  =  0  and  $(q*)  ~  12.21908,  while  in  case  (ii), 
the  optimal  routing  parameter  is  q*  =  1  with  a  corresponding  minimum  cost  of  $(q*)  ~  3.98708. 


(a)  Uniform  case  (i):  $(<?)  is  monotone  increasing.  (b)  Uniform  case  (ii):  "&{q)  is  monotone  decreasing. 

Figure  3:  Sample  monotone  cost  functions  when  service  time  is  uniform. 

Finally,  in  Figure  4,  we  use  cq  =  3  and  cr  =  1  so  that  cq/cr  —  1  =  2,  and  the  cost  function 
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is  strictly  convex  on  [0, 1]  in  accordance  with  part  (iii)  of  Proposition  4.  The  optimal  Bernoulli 
routing  parameter,  and  corresponding  minimum  cost,  are  given  by  q*  ~  0.8400  and  i?(g*)  ~  4.93731, 
respectively. 


Figure  4:  Uniform  case  (iii):  i)(q)  is  strictly  convex. 

As  expected,  it  is  seen  that  the  optimal  Bernoulli  routing  parameter  that  minimizes  the  total 
expected  holding  costs  per  unit  time  is  on  the  interior  of  the  feasible  region. 

Next,  we  examine  the  mean  queue  lengths  as  a  function  of  the  overall  traffic  intensity,  p.  In 
the  experiments  that  follow,  we  assumed  p  =  3,  a  =  4,  £  =  0.1,  6  =  5,  q  =  0.5,  and  we  varied  A  to 
obtain  increasing  values  of  p.  Table  1  shows  a  comparison  of  the  computed  mean  queue  lengths  as 
compared  to  the  same  values  obtained  via  a  discrete-event  simulation  model  when  the  service  time 
is  exponential  with  parameter  p  =  3. 

Table  1:  Mean  queue  lengths  with  exponential  service  times. 


p 

E  (R) 

Analytical  Simulated 

E(Q) 

Analytical  Simulated 

E  (N) 

Analytical  Simulated 

0.1 

0.015922 

0.015890 

0.005898 

0.005920 

0.119381 

0.119470 

0.3 

0.140043 

0.139810 

0.052832 

0.052670 

0.485558 

0.485170 

0.5 

0.542132 

0.541680 

0.161568 

0.161440 

1.191505 

1.190800 

0.7 

1.874600 

1.868800 

0.359237 

0.358610 

2.916764 

2.910300 

0.9 

10.207577 

10.173000 

0.691578 

0.690730 

11.777204 

11.741000 

It  is  interesting  to  note  in  Table  1  that  there  is  a  dramatic  increase  in  E (R)  (and  E(1V))  when  the 
traffic  intensity  increases  from  0.7  to  0.9.  By  contrast,  the  increase  in  the  mean  primary  queue  size, 
E(Q),  is  more  moderate. 

For  the  case  of  uniformly-distributed  service  times,  the  parameter  values  are  the  same;  however, 
the  service  time  is  assumed  to  be  t/(0,  2 / p)  where  p  is  the  rate  parameter  of  the  exponential  case. 


16 


Note  that  the  mean  service  time  is  identical  to  that  of  the  exponential  case.  Table  2  summarizes 
the  steady  state  mean  queue  lengths  and  mean  number  of  customers  in  the  system.  In  this  exper- 


Table  2:  Mean  queue  lengths  with  uniform  service  times. 


p 

E  (R) 

Analytical  Simulated 

E(Q) 

Analytical  Simulated 

E(Xj 

Analytical  Simulated 

0.1 

0.013951 

0.013950 

0.004270 

0.004270 

0.115781 

0.115750 

0.3 

0.114872 

0.114950 

0.036583 

0.036580 

0.444137 

0.444220 

0.5 

0.432099 

0.432210 

0.110704 

0.110660 

1.030608 

1.030600 

0.7 

1.463517 

1.463800 

0.244981 

0.244980 

2.391425 

2.391500 

0.9 

7.824286 

7.830100 

0.470328 

0.470290 

9.172663 

9.178500 

irnent,  we  also  note  that,  as  the  overall  traffic  intensity  increases,  there  is  a  much  more  profound 
effect  on  E (R)  and  E(N )  than  on  the  mean  primary  queue  length  whose  stability  depends  on  p\ . 
Specifically,  when  p  =  0.90,  pi  ~  0.4353.  Therefore,  the  increase  in  the  overall  traffic  intensity  from 
0.7  to  0.9  results  in  only  a  moderate  increase  in  the  primary  queue  length. 
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